Compensation of noise effects for robust speech recognition in car environments
نویسندگان
چکیده
In this paper, we propose a novel method to compensate the effect of the noise for Automatic Speech Recognition in car environments. This method can be applied to recognizers using a standard MFCC front-end. We perform a channel-by-channel compensation of the noise effect in the filter-bank output domain. In a first stage, the parameters describing the noise are estimated and secondly, we estimate the expected value of the clean speech in a probabilistic framework. The compensated filter-bank outputs are then used to obtain a compensated version of the MFCC-based parameters representing the speech signal. Recognition experiments using the French VODIS database (recorded in several cars running in real traffic situations) have been carried out to test the proposed compensation method. The results show the capability of the proposed method for the compensation of the noise effect in car environments.
منابع مشابه
Robust automatic speech recognition for accented Mandarin in car environments
This paper addresses the issues of robust automatic speech recognition (ASR) for accented Mandarin in car environments. A robust front-end is proposed, which adopts a Minimum Mean-Square Error (MMSE) estimator to suppress the background noise in frequency domain, and then implements spectrum smoothing both in time and frequency index to compensate those spectrum components distorted by the nois...
متن کاملروشی جدید در بازشناسی مقاوم گفتار مبتنی بر دادگان مفقود با استفاده از شبکه عصبی دوسویه
Performance of speech recognition systems is greatly reduced when speech corrupted by noise. One common method for robust speech recognition systems is missing feature methods. In this way, the components in time - frequency representation of signal (Spectrogram) that present low signal to noise ratio (SNR), are tagged as missing and deleted then replaced by remained components and statistical ...
متن کاملOn the comparison of front-ends for robust speech recognition in car environments
In this paper we compare several front-ends for Automatic Speech Recognition systems operating under noise conditions. The analyzed front-ends are based on standard MFCC parameterizations and include methods to compensate the effect of the noise over the representation of the speech signal. Three different compensation methods are considered in this work: Cepstral Mean Normalization, Spectral S...
متن کاملImproving the performance of MFCC for Persian robust speech recognition
The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...
متن کاملStatistical Adaptation of Acoustic M for Robust Speech Re
Noise degrades the performance of Automatic Speech Recognition (ASR) systems working in real condition. The mismatch between the training and recognition conditions is considered the main factor involved in this degradation, and most methods for robust ASR are focussed on its minimization. In this work, we compare robust methods for ASR based on (a) the compensation of the noise effects and (b)...
متن کامل